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1.  Introduction 


'V 

SRI  International  is  engaged  in  a  long-term  research  effort  under  DARPA  sponsorship  to  develop 
techniques  for  facilitating  the  acquisition  of  knowledge  by  computers.  The  ultimate  goal  is  to  build 
KLAUS  (Knowledge-Learning  And  -Using  System),  a  computer  program  that  could  acquire  a  model  of 
a  domain  of  interest  by  being  instructed  in  English.  The  model  could  then  be  used  for  domain-related 
tasks.  Systems  based  on  the  KLAUS  concept  would  be  useful  for  a  variety  of  applications,  including 
sophisticated  interfaces  to  computer  software,  advanced  database  systems,  and  intelligent  computer 
assistants. 

'  This  report  covers  work  done  on  the  KLAUS  project  during  the  period  July  1981  to  January 
1982.  Section  2  describes  DIALOGIC,  the  natural-language  processing  system  used  in  MICRO  KLAUS, 
which  is  our  current  preliminary  implementation  of  the  KLAUS  concept.  The  DIALOGIC  system 
translates  English  sentences  into  representations  of  their  literal  meaning  in  the  context  of  an  utterance. 
These  representations,  or  logical  fornu,  are  intended  to  be  a  purely  formal  language  that  is  as  dose  as 
possible  to  the  structure  of  natural  language,  while  providing  the  semantic  compositionality  necessary 
for  meaning-dependent  computational  processing.  The  design  of  DIALOGIC  (and  of  its  constituent 
modules)  was  influenced  by  the  goal  of  using  it  as  a  core  language-processing  component  in  a  variety  of 
systems,  some  of  which  are  transportable  to  new  domains  of  application. 


Section  3  describes  a  new  scheme  for  syntax-directed  translation  of  natural  language  into 
logical  form.  That  scheme,  comprising  the  basis  of  an  English  translation  system  called  PATR,  was 
used  to  specify  a  semantically  interesting  fragment  of  English  that  included  such  constructs  as  tense, 
aspect,  modals,  and  various  lexically  controlled  verb  complement  structures.  PATR  was  embedded  in  a 
question-answering  system  that  replied  appropriately  to  questions  requiring^e  computation  of  logical 
entailments. 


Section  4  describes  a  new  theorem-proving  program  developed  for  KLAUS.  It  combines  the  use 
of  nonclausa!  resolution  and  connection  graphs.  The  use  of  nonclausal  resolution  as  the  inference  system 
eliminates  some  of  the  redundancy  and  unreadability  of  clause-based  systems.  The  use  of  a  connection 
graph  restricts  the  search  space  and  facilitates  graph  searching  for  efficient  deduction.  This  theorem- 


proving  program  is  used  as  the  deduction  component  of  both  MICROKLAUS  (of  which  DIALOGIC, 
described  in  Section  2,  is  the  natural-laagu  age-processing  component)  and  the  PATR-baaed  question¬ 
answering  system  described  in  Section  3. 


» 
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2.  DIALOGIC:  A  Core  Natural-Language-Processing  System 


This  section  wo*  written  by  Barbara  Grosz,  Norman  Hoot,  Gary  Hendrix,  Jerry  Hobbs,  Paid 
Martin,  Robert  Moore,  Jane  Robinson,  and  Stanley  Rosenschein.  Norman  Haas  and  Gary  Hendrix  are 
now  with  Symantec  in  Sunnyvale,  California. 


2.1.  Overview 

The  DIALOGIC  system  translates  English  sentences  into  representations  of  their  literal  meaning 
in  the  context  of  an  utterance.  These  representations,  or  logical  forms,  are  intended  to  be  a  purely  formal 
language  that  is  as  close  as  possible  to  the  structure  of  natural  language,  while  providing  the  semantic 
compositionality  necessary  for  meaning-dependent  computational  processing.  The  design  of  DIALOGIC 
(and  of  its  constituent  modules)  was  influenced  by  the  goal  of  using  it  as  the  core  language-processing 
component  in  a  variety  of  systems,  some  of  which  are  transportable  to  new  domains  of  application. 

Currently  DIALOGIC  is  a  core  component  of  KLAUS  and  three  other  systems  being  developed 
in  several  different  research  projects  at  SRI.  One  is  the  TEAM  project,1  whose  goal  is  to  provide  natural- 
language  access  to  large  databases  through  systems  that  are  easily  adaptable  to  a  wide  range  of  new 
applications.  A  second  project  is  investigating  the  problem  of  providing  natural-language  access  to  text.2 
A  third,  in  which  DIALOGIC  also  plays  an  important  role,  is  examining  the  development  of  formal 
grammars.3 

DIALOGIC  is  divided  into  five  modules  coordinated  by  the  DIAMOND  executive  system. 
DIAMOND  is  a  modification  of  the  executive  system  used  in  the  SRI  speech-understanding  project 
[Wa78]  as  well  as  in  a  task-dialogue  interpretation  system  (R08O).  It  provides  the  formal  language  for 
defining  the  grammar  and  the  control  for  parsing  English  sentences  and  translating  them  into  logical- 
form  expressions. 

Sponsored  by  the  Defenie  Advanced  Research  Project*  Agency  under  Contract  N000SS-8O-C-064S. 

2Funded  by  the  Nations)  Library  of  Medicine  nnder  Grant  IROl-LMOSSU. 

Sponsored  by  the  National  Science  Foundation  under  Grant  IST-810SS50. 
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The  five  modules  are  (1)  the  DIAGRAM  grammar;  (2)  a  set  of  semantic  translators;  (3)  a  set 
of  basic  semantic  functions;  (4)  a  scoping  algorithm  (for  quantifiers  and  sentence  operators);  (5)  a  set 
of  basic  pragmatic  functions.  The  remainder  of  this  section  describes  these  components  of  DIALOGIC 
and  presents  an  example  illustrating  how  they  coordinate  with  one  another  in  the  interpretation  of  an 
utterance.  A  description  of  the  logical  form  that  is  the  target  of  DIALOGIC’s  interpretation  processes 
may  be  found  in  |Mo8l]. 

2.2.  DIAGRAM 

DIAGRAM  is  a  general  grammar  of  English.  It  now  contains  about  125  rule  schemata,  equiv¬ 
alent  to  about  800  individual  rules.  These  define  all  the  common  sentence  types,  complex  auxiliaries 
and  modals,  complex  noun  phrases,  nominalized  sentences,  all  the  common  quantifiers,  relative  clauses, 
verbs  with  sentential  complements,  comparative  and  measure  expressions,  subordinate  clauses  and  other 
adverbial  modifiers.  Conjunction,  however,  is  limited  to  a  few  placeholders,  pending  further  study  of  the 
problems  it  poses  for  constraining  the  number  of  syntactic  analyses.  A  detailed  description  of  DIAGRAM 
is  contained  in  [Ro82j.  In  a  formal  sense,  DIAGRAM  is  an  augmented  phrase-structure  grammar.  The 
lexicon  categorizes  words  and  associates  attributes  with  them  that  are  used  in  the  rules.  Each  rule  has 
associated  with  it  a  constructor  that  expresses  the  constraints  on  its  application  and  also  a  translator 
(described  in  the  next  subsection)  that  produces  the  corresponding  logical  form. 

Phrases  inherit  attributes  from  their  constituents  and  acquire  attributes  from  the  larger  phrases 
that  contain  them.  These  attributes  are  used  to  impose  context-sensitive  constraints  upon  the  acceptance 
of  an  analysis.  Before  constructing  a  node  in  the  parse  tree  corresponding  to  the  application  of  a  rule, 
the  executive  invokes  the  rule’s  constructor  to  test  for  admissibility.  In  addition  to  accepting  or  rejecting 
a  rule  application,  the  constructors  can  assign  scores  that  allow  the  listing  of  alternative  analyses  in  a 
preferred  order.  The  result  of  applying  the  grammar  to  analysis  of  an  input  is  one  or  more  annotated 
parse  trees. 

Attributes  and  annotations  are  not  limited  to  syntactic  information.  The  translators,  described 
next,  specify  how  the  translation  of  a  phrase  into  logical  form  is  to  be  defined  in  terms  of  the  attributes 
of  the  words  and  phrases  that  compose  it.  This  coupling  of  syntax  and  semantics  (for  which  attribute 


grammars  [Ti80j  were  originally  designed)  is  convergent  with  current  formal  theories  of  natural  language 
that  advocate  constructing  a  syntax  and  semantics  that  “work  in  tandem"  [DW81,  KB-,  Ga80,  La76.{ 

Future  work  on  DIAGRAM  will  include  efforts  to  extend  both  its  coverage  and  its  formalism. 
In  extending  the  formalism,  our  dual  objective  is  to  capture  certain  linguistic  generalizations  (e.g.,  dative 
movement)  and  to  make  the  task  of  developing  a  large  grammar  more  manageable.  To  accomplish  this, 
we  are  exploring  the  use  of  metarules  [Ga80|. 

2.3.  lYanslators 

Following  the  syntactic  analysis  of  an  utterance,  a  sequence  of  semantic  translators  is  invoked 
to  build  the  logical  form  that  corresponds  to  a  literal  interpretation  of  the  utterance  in  context.  The 
translator  for  each  phrase-structure  rule  specifies  how  the  various  constituents  of  the  phrase  are  to  be 
combined  to  form  an  interpretation  of  the  whole  phrase.  It  prescribes  the  predicate-argument  structures 
that  correspond  to  the  grammatical  construction  or,  more  generally,  the  operator-operand  structures. 

Although  the  translators  operate  top-down  (the  translator  for  each  node  invokes  the  translators 
for  its  children),  the  translation  is  in  effect  built  bottom-up — since,  typically,  the  first  thing  a  translator 
for  a  nonterminal  node  does  is  to  invoke  the  translators  for  each  of  its  constituents,  usually  left  to  right. 
However,  the  top-down  nature  of  the  translation  process  is  significant,  because  it  means  that  information 
located  above  a  node  and  to  its  left  is  available  when  the  node  is  translated.  In  addition  to  producing 
the  logical  form,  the  translators  determine  the  syntactic  constraints  upon  and  preferences  for  either 
coreference  or  noncoreference  of  noun  phrases,  especially  pronouns,  in  accordance  with  an  algorithm 
described  in  (Ho76|. 

2.4.  Basic  Semantic  Functions 

To  insulate  changes  in  the  grammar  from  those  that  occur  in  logical  form,  the  construction  of 
the  latter  is  isolated  from  the  translator  procedures  by  calls  on  basic  semantic  functions  [Ko79]).  The 
actual  construction  of  a  logical  form  is  done  in  two  phases:  (1)  logical-form  fragments  (Ilfs)  are  attached 
to  the  parse  tree  by  the  basic  semantic  functions;  (2)  the  final  logical  form  is  assembled  from  these  by 
the  scoping  algorithm. 


Lffs  are  assigned  only  to  certain  nodes  in  the  parse  tree.  Usually  the  iff  at  an  NP  node  will  encode 
the  properties  held  by  the  entity  the  NP  describes  [e.g.,  “X  such  that  EMPLOYGE(X)  &  OLD(X)”  for 
“old  employee”!  an<*  the  fragment  for  a  clause-level  construction  (e.g.,  a  VP)  will  encode  the  predicate- 
argument  structure  of  the  clause. 

The  basic  semantic  functions  also  leave  markers  on  the  parse  tree  to  indicate  such  things  as  the 
type  of  quantifier  or  determiner  associated  with  a  noun  phrase.  These  markers  are  used  by  the  scoping 
algorithm  to  determine  the  final  logical  form  for  the  utterance^  (Note  that  the  lffs  and  markers  left  by 
the  basic  semantic  functions  may  be  viewed  as  further  annotations  to  the  parse  tree.) 

DIALOGIC  currently  includes  eleven  basic  semantic  functions.  Six  of  these  do  most  of  the  work 
of  building  lffs  for  standard  noun  phrases  and  clauses.  The  others  are  concerned  with  adding  such  things 
as  mode,  degree,  and  adverbial  modification  to  clauses.  As  more  precise  specifications  are  defined  for 
encoding  those  phenomena  in  logical  form,  we  expect  to  eliminate  some  of  these  basic  semantic  functions. 

2.5.  Scoping  of  Quantifiers  and  Other  Sentential  Operators 

The  scoping  algorithm  is  designed  to  collect  the  logical-form  fragments  from  the  parse  tree  and 
produce  the  possible  scopings  of  quantifiers  and  other  scoped  operators.  The  scoping  algorithm  used  in 
DIALOGIC  (adapted  from  that  in  (He78|)  produces  all  the  scopings  that  do  not  violate  the  hard  rules 
of  English  scoping,  and  then  ranks  them  according  to  a  score  computed  by  a  set  of  specialist  critics. 
Each  critic  is  a  function  that  returns  a  score  for  some  aspect  of  the  conflicting  rules  of  quantification 
in  English;  e.g.,  the  left-right  scope  critic  lowers  the  score  of  scopinp  that  involve  permuting  the  left- 
outermost  default  ordering  of  quantifiers.  All  critics  receive  equal  weight  in  the  present  implementation, 
but  the  design  of  the  system  does  allow  for  differential  weighting. 

The  current  set  of  critics  is  concerned  with  such  things  as  changes  in  sentence  order  and  the 
relative  scoping  of  quantifiers  of  different  strengths.  The  scoping  of  nonstandard  quantifiers  and  of  the 
generalized  negative  (“not,”  “no  one,”  “nothing,”  “none”)  remains  to  be  done. 

2.6.  Basic  Pragmatic  Functions 

Basic  pragmatic  functions  are  intended  to  fulfill  several  roles  in  DIALOGIC,  all  concerned  with 


certain  kinds  of  indeterminacies  in  logical  form  whose  resolution  requires  pragmatic  information.  The 
four  primary  uses  of  basic  pragmatic  functions  in  the  current  system  are  (1)  to  provide  a  context-specific 
interpretation  of  certain  terms  that  have  only  vague  meanings  in  themselves  (e.g.,  prepositions  like  “of” 
and  “in,”  or  inherently  vague  verbs  like  “have”);  (2)  to  establish  the  specific  relationship  underlying  any 
given  noun-noun  combination;  (3)  to  identify  the  referents  of  pronouns;  and  (4)  to  interpret  a  limited 
range  of  metonymy  (e.g.,  the  use  of  “blonds”  to  mean  “people  with  blond  hair”).  At  present,  only  a 
small  core  of  pragmatic  functions  is  implemented,  each  of  which  handles  but  a  subset  of  the  cases  it  is 
intended  to  cover. 


2.7.  Example 

To  illustrate  how  the  different  modules  of  DIALOGIC  contribute  to  the  interpretation  of  an 
utterance,  we  shall  consider  the  example 

fhat  SRI  enployeea  have  children  older  than  IS  years? 

The  logical  form  for  this  query — the  target  of  the  interpretation  processes— is  (lowercase  is  used  to 

indicate  variables,  uppercase  to  indicate  constants  and  predicates): 

[QUERY  (VH  enployeel  (AID  (EMPLOYEE  eaployeel) 

(EMPLOYEES-COMPART-PF  eaployeel  SRI)) 

(SOKE  ckildS  (CHILD  child?)  (AID  (CHILD-OF  eaployeel  ckild2) 

((•KOREe  OLD)  child2  (YEAR  IS] 

This  corresponds  roughly  to  a  formal  representation  for  “Who  is  each  employee  such  that  the  company 
of  the  employee  is  SRI  and  some  child  of  the  employee  is  older  than  fifteen  years?” 

During  DIAMOND’S  parsing  phase,  the  parse  tree  in  Figure  1  is  constructed.  At  this  point, 
the  attributes  annotating  the  tree  encode  such  properties  as  the  type  of  noun  (count,  mass,  unit)  and 
the  syntactic  number.  These  attributes  have  been  used  during  the  parsing  phase  to  rule  out  certain 
alternative  structures. 

Once  this  structure  is  built,  the  translators  are  invoked.  In  combination  with  the  bask  semantk 
functions,  the  translators  assign  additional  attributes  to  nodes  in  the  tree,  encoding  such  information 
as  the  quantifiers  (type,  strength,  and  the  variables  they  bind)  and  the  heads  of  noun  phrases.  For 
example,  the  head  of  the  WHNP,  “what  SRI  employees”,  is  a  variable  of  type  EMPLOYEE  that  is 
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bound  by  a  wh-type  quantifier.  Attributes  also  encode  the  underlying  predicate-argument  structures  for 
verb  phrases  and  adjectives  as  well  as  the  lffs  to  be  used  in  constructing  the  final  logical  form  for  the 
utterance. 

In  the  sentence  of  Figure  1,  the  nodes  WHNP  and  S  are  annotated  as  being  quantified,  WHNP 
with  a  wh-type  quantifier  and  S  as  a  “query.”  Although  every  rule  has  an  associated  translator,  only 
some  of  these  result  in  lffs  being  attached  to  nodes.  For  this  example,  the  nodes  marked  with  **  in  the 
original  parse  tree  are  the  only  ones  for  which  lffs  are  produced. 

The  fragment  attached  to  each  of  these  nodes  is  as  follows: 

NOUNI  (*NN*  emplcyeel  SRI) 

NOUN2  (EMPLOYEE  employee!) 

PREDICATE  (*HAVE  employed  child2) 

NOUN3  (CHILD  child2) 

NCOMP  (('MORE*  OLD)  child2  (YEAR  15)) 

EMPLOYEE,  CHILD,  and  OLD  are  unary  predicates  that  are  part  of  the  conceptual  model  of  the 
domain.  *MORE*  maps  a  predicate  into  a  comparative  along  the  scale  corresponding  to  the  predicate. 
*NN*  and  “HAVE  are  dummy  predicates  that  indicate  the  need  to  invoke  the  basic  pragmatic  functions. 

After  the  translation  process  is  complete,  the  final  logical  form  is  assembled  by  a  procedure  that 
considers  alternative  quantifier  scopings  (using  the  quantifier-related  annotations  left  on  the  parse  tree) 
and  invokes  the  basic  pragmatic  functions  as  needed.  The  basic  pragmatic  functions  use  information  in 
the  conceptual  model  of  the  domain  to  transform  (*NN*  employed  SRI) — corresponding  to  the  noun¬ 
noun  compound  “SRI  employee” — into  (EMPLOYEE-OF  employed  SRI)  and  (*HAVE  employed  child2) 
into  (CHILD-OF  employed  child2). 

The  nodes  with  either  quantifier  or  logical-form  markings  are  the  only  ones  considered  by  the 
TEAM  scoping  algorithm.  Besides  the  WH  quantifying  employeel,  TEAM  recognizes  that  a  default 
existential  quantifier  must  be  created  for  child2,  so  SOME  is  added.  The  scope  rules  force  QUERY  to 
have  the  widest  scope;  this  position  is  contested  only  if  there  are  multiple  sentential  markers.  Both 
orderings  of  the  WH  and  SOME  quantifiers  are  generated.  The  two  resulting  quantified  statements 
correspond  to  (WH  employed  (SOME  child  ...)...)  and  (SOME  child  (WH  employee  ...)...) 

Next  the  scope  critic  functions  evaluate  the  different  scopings;  only  three  of  the  critics  are 


8 


relevant.  One  critic  considers  the  left-right  node  ordering  and  prefers  the  first  scoping  because  it  comes 
closer  to  the  surface  form.  One  critic  prefers  scopings  in  which  WH  outscopes  an  adjacent  existential;  it 
too  upgrades  the  score  of  the  first  and  downgrades  the  score  of  the  second.  The  other  critic  knows  that 
default  existential  quantifiers  need  the  narrowest  possible  scope;  it  too  selects  the  first  scoping. 

2.8.  Summary 

Because  of  the  modularization  in  DIALOGIC,  changes  in  one  part  of  the  system  reverberate 
very  little  in  other  components.  Changes  in  the  constraints  imposed  on  the  phrase-structure  rules  in 
the  grammar  have  no  effect  on  any  other  part  of  the  system.  A  change  in  a  rule  itself  necessitates  a 
change  in  the  corresponding  translator,  but  the  basic  semantic  functions  do  not  need  to  be  revised. 
Similarly,  a  change  in  the  logical  form  or  in  the  data  structures  within  which  it  is  implemented  requires 
a  corresponding  change  in  the  basic  semantic  functions,  but  not  in  the  grammar  or  translators. 

In  addition  to  extending  DIALOGIC  as  mentioned  in  the  foregoing  sections,  we  are  also 
investigating  possible  revisions  of  the  translation  phase  (as  currently  performed  by  the  translators  and 
basic  semantic  functions)  to  allow  translation  into  logical  form  to  be  specified  declarative^.  In  this  new 
approach  (described  in  the  following  section),  logical  types  are  associated  with  phrasal  categories,  and 
the  translation  of  a  phrase  is  synthesized  from  the  translations  of  its  immediate  constituents  according 
to  a  local  rule,  which  typically  involves  functional  application. 
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Figure  1.  Parse  Tree  for  “What  SRI  employees  have  children  older  than  15  years?" 


3.  Translating  English  into  Logical  Form 


This  section  was  written  by  Stanley  J.  Rosensckein  and  Stuart  M.  Skieber. 

3.1.  Introduction 

When  contemporary  linguists  and  philosophers  speak  of  “semantics,”  they  usually  mean  model* 
theoretic  semantics — mathematical  devices  for  associating  truth  conditions  with  sentences.  Computational 
linguists,  on  the  other  hand,  often  use  the  term  “semantics”  to  denote  a  phase  of  processing  in  which 
a  data  structure  (e.g.,  a  formula  or  network)  is  constructed  to  represent  the  meaning  of  a  sentence  and 
serve  as  input  to  later  phases  of  processing.  (A  better  name  for  this  process  might  be  “translation” 
or  “transduction.”)  Whether  one  takes  “semantics”  to  be  about  model  theory-  or  translation,  the  fact 
remains  that  natural  languages  are  marked  by  a  wealth  of  complex  constructions — such  as  tense,  aspect, 
moods,  plurals,  modality,  adverbials,  degree  terms,  and  sentential  complements — that  make  semantic 
specification  a  complex  and  challenging  endeavor. 

Computer  scientists  faced  with  the  problem  of  managing  software  complexity  have  developed 
strict  design  disciplines  in  their  programming  methodologies.  One  might  speculate  that  a  similar 
requirement  for  manageability  has  led  linguists  (since  Montague,  at  least)  to  follow  a  discipline  of  strict 
compositionality  in  semantic  specification,  even  though  model-theoretic  semantics  per  se  does  not  demand 
it.  Compositionality  requires  that  the  meaning  of  a  phrase  be  a  function  of  the  meanings  of  its  immediate 
constituents,  a  property  that  allows  the  grammar  writer  to  correlate  syntax  and  semantics  on  a  rule-by¬ 
rule  basis  and  keep  the  specification  modular.  Clearly,  the  natural  analogue  to  compositionality  in  the 
case  of  translation  is  syntax-directed  translation;  it  is  this  analogy  that  we  seek  to  exploit. 

We  describe  a  syntax-directed  translation  scheme  that  bears  a  close  resemblance  to  model- 
theoretic  approaches  and  achieves  a  level  of  perspicuity  suitable  for  the  development  of  large  and  complex 
grammars  by  using  a  declarative  format  for  specifying  grammar  rules.  In  our  formalism,  translation 
types  are  associated  with  the  phrasal  categories  of  English  in  much  the  way  that  logical-denotation  types 
are  associated  with  phrasal  categories  in  model-theoretic  semantics.  The  translation  types  are  classes 


of  data  object s  ratber  than  abstract  denotation t,  yet  they  play  much  the  same  role  in  the  translation 
process  that  denotation  types  play  in  formal  semantics. 

In  addition  to  this  parallel  between  logical  types  and  translation  types,  we  have  intentionally 
designed  the  language  in  which  translation  rules  are  stated  to  emphasize  parallels  between  the  syntax* 
directed  translation  and  corresponding  model-theoretic  interpretation  roles  found  in,  say,  the  GPSG 
literature  [Ga80j.  In  the  GPSG  approach,  each  syntax  rule  has  an  associated  semantic  rule  (typically 
involving  functional  application)  that  specifies  how  to  compose  the  meaning  of  a  phrase  from  the 
meanings  of  its  constituents.  In  an  analogous  fashion,  we  provide  for  the  translation  of  a  phrase  to 
be  synthesized  from  the  translations  of  its  immediate  constituents  according  to  a  local  rule,  typically 
involving  symbolic  application  and  X-conversion. 

It  should  be  noted  in  passing  that  doing  translation,  rather  than  model-theoretic  interpretation, 
offers  the  temptation  to  abuse  the  formalism  by  having  the  “meaning”  (translation)  of  a  phrase  depend 
on  syntactic  properties  of  the  translations  of  its  constituents — for  instance,  on  the  order  of  conjuncts  in 
a  logical  expression.  There  are  several  points  to  be  made  in  this  regard.  First,  without  severe  a  priori 
restrictions  on  what  kinds  of  objects  can  be  translations  (coupled  with  the  associated  strong  theoretical 
claims  that  such  restrictions  would  embody),  it  seems  impossible  to  prevent  such  abuses.  Second,  as  in 
the  case  of  programming  languages,  it  is  reasonable  to  assume  that  there  would  emerge  a  set  of  stylistic 
practices  that,  for  reasons  of  manageability  and  esthetics,  would  govern  the  actual  form  of  grammars. 
Third,  it  is  still  an  open  question  whether  the  model-theoretic  program  of  strong  composition ality  will 
actually  succeed.  Indeed,  whether  it  succeeds  or  not  is  of  little  concern  to  the  computational  linguist, 
whose  systems,  in  any  event,  have  no  direct  way  of  using  the  sort  of  abstract  model  being  proposed  and 
must  generally  be  based  on  deduction  (and  hence  translation). 

The  rest  of  this  section  discusses  our  work  in  more  detail.  Section  3.2  presents  the  grammar 
formalism  and  describes  PATR,  an  implemented  parsing  and  translation  system  that  can  accept  a 
grammar  in  our  formalism  and  use  it  to  process  sentences.  Examples  of  the  system’s  operation,  including 
its  application  in  a  simple  deductive  question-answering  system,  are  found  in  Section  3.3.  Finally,  Section 
3.-1  describes  further  extensions  of  the  formalism  and  the  parsing  system.  Three  appendices  are  included: 
the  first  contains  sample  grammar  rules;  the  second  contains  meaning  postulates  (axioms)  used  by  the 
question-answering  system;  the  third  presents  a  sample  dialogue  session. 
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3.2.  A  Grammar  Formalism 


3.2.1  General  Characterization 

Our  grammar  formalism  is  best  characterised  as  a  specialised  type  of  augmented  context-free 
grammar.  That  is,  we  take  a  grammar  to  be  a  set  of  context-free  rules  that  define  a  language  and,  in 
the  usual  way,  associate  structural  descriptions  (parse  trees)  for  each  sentence  in  that  language.  Nodes 
in  the  parse  tree  are  assumed  to  have  a  set  of  features  that  may  assume  binary  values  (True  or  False), 
and  there  is  a  distinguished  attribute — the  “translation” — whose  values  range  over  a  potentially  infinite 
set  of  objects,  i.e.,  the  translations  of  English  phrases. 

Viewed  more  abstractly,  we  regard  translation  as  a  binary  relation  between  word  sequences 
and  logical  formulas.  The  use  of  a  relation  is  intended  to  incorporate  the  fact  that  many  word  sequences 
have  several  logical  forms,  while  some  have  none  at  all.  Furthermore,  we  view  this  relation  as  being 
composed  (in  the  mathematical  sense)  of  four  simpler  relations  corresponding  to  the  conceptual  phases 
of  analysis:  (1)  LEX  (lexical  analysis),  (2)  PARSE  (parsing),  (3)  ANNOTATE  (assignment  of  attribute 
values,  syntactic  filtering),  and  (4)  TRANSLATE  (translation  proper,  i.e.,  synthesis  of  logical  form). 

The  domains  and  ranges  of  these  relations  are  as  follows: 

Word  Sequences  — J LEX-* 

Morpheme  Sequences  -PARSE-* 

Phrase  Structure  Trees  -ANNOTATE-* 

Annotated  Trees  -TRANSLATE-* 

Logical  Form 

The  relational  composition  of  these  four  relations  is  the  full  translation  relation  associating 
word  sequences  with  logical  forms.  The  subphases  too  are  viewed  as  relations  to  reflect  the  inherent 
nondeterminism  of  each  stage  of  the  process.  For  example,  the  sentence  “a  hat  by  every  designer  sent 
from  Paris  was  felt”  is  easily  seen  to  be  nondeterministic  in  LEIX  (“felt”),  PARSE  (postnominal  modifier 
attachment),  and  TRANSLATE  (quantifier  scoping). 

It  should  be  emphasized  that  the  correspondence  between  processing  phases  and  these  concep¬ 
tual  phases  is  loose.  The  goal  of  the  separation  is  to  make  specification  of  the  process  persp icons  and  to 
allow  simple,  clean  implementations.  An  actual  system  could  achieve  the  net  effect  of  the  various  stages 
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RULES: 

Coutut  COMP’  >  (X  P  (X  a  (X  X  (P  (Q  X))))) 

S  -  IP  ?P 
Trans:  TP’tlP’] 

TP  —  inSE  T 

Auo:  [  -'Transitive  (Y)  ] 

Ir»u:  {  COMP’  [TE1SE*]  [Y’J  } 

LEXICOI: 
lir  -*  John 

laao:  [  Proper(lP)  ] 

Trans:  {  Jobs  } 

TEISE  —  ftpast 
Trans:  {(XX  (paafc  X))  } 

¥  —  go 

Aaao:  [  -'Transitive  (V)  ] 

Trans:  {  (X  X  (go  X))  } 

Figure  1:  Sample  specification  ot  augmented  phrase  structure  grammar 

in  many  ways,  and  numerous  optimizations  could  be  envisioned  that  would  have  the  effect  of  folding 
back  later  phases  to  increase  efficiency. 


3.2.2  The  Relations  LEX,  PARSE,  and  ANNOTATE 

We  now  describe  a  characteristic  form  of  specification  appropriate  to  each  phase,  and  illustrate 
how  the  word  sequence  “John  went”  is  analyzed  by  stages  as  standing  in  the  translation  relation  to 
“(past  (go  John))”  according  to  the  (trivial)  grammar  presented  in  Figure  1. 

Lexical  analysis  is  specified  by  giving  a  kernel  relation  between  individual  words  and  morpheme 

sequences1  (or  equivalently,  a  mapping  from  words  to  sets  of  morpheme  sequences),  for  example: 

John  -»  (John): 

vunfc  -»  (ftpast  go); 

puruaadud  -*  (ftpast  persuade), 

(ftppl  peraiade) ; 

The  kernel  relation  is  extended  in  a  standard  fashion  to  the  full  LEIX  relation.  For  example, 


*Of  count,  more  sophisticated  approaches  to  morphological  analysis  would  seek  to  analyte  the  LEX  relation  mol*  tally. 
See,  for  example,  (KRSlj  and  [KK8l|. 
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“went”  is  mapped  to  tbe  single  morpheme  sequence  (ftpast  go),  and  “John”  is  mapped  to  (john).  Thus, 
by  extension,  “John  went”  is  transformed  to  (John  ftpast  go)  by  the  lexical  analysis  phase. 

Parsing  is  specified  in  the  usual  manner  by  a  context-free  grammar.  Through  utilization  of  the 
context-free  rules  presented  in  the  sample  system  specification  shown  in  Figure  1,  (John  ftpast  go)  is 
transformed  into  the  parse  tree 

(S  (IP  John) 

(TP  (TEISE  ftpast) 

(T  go))) 

Every  node  in  the  parse  tree  has  a  set  of  associated  features.  The  purpose  of  ANNOTATE  is  to 
relate  the  bare  parse  tree  to  one  that  has  been  enhanced  with  attribute  values,  filtering  out  those  that  do 
not  satisfy  stated  syntactic  restrictions.  These  restrictions  are  given  as  Boolean  expressions  associated 
with  the  context-free  rules;  a  tree  is  properly  annotated  only  if  all  the  Boolean  expressions  corresponding 
to  the  rules  used  in  the  analysis  are  simultaneously  true.  Again,  using  the  rules  of  Figure  1, 

(S  (IP  John) 

(TP  (TEISE  ftpast) 

(T  go))) 

is  transformed  into 

(S  (IP:  Proper 
John) 

(TP:  ->  Transitive 
(TEISE  ftpast) 

(T:  -<  Transitive 
go))) 

3.2.3  The  Relation  TRANSLATE 

Logical-form  synthesis  rules  are  specified  as  augments  to  the  context-free  grammar.  There  is  a 
language  whose  expressions  denote  translations  (syntactic  formulas);  an  expression  from  this  language  is 
attached  to  each  context-free  rule  and  serves  to  define  the  composite  translation  at  a  node  in  terms  of 
tbe  translations  of  its  immediate  constituents.  In  the  sample  sentence,  TENSE’  and  V’  (the  translations 
of  TENSE  and  V  respectively)  would  denote  the  X-expressions  specified  in  their  respective  translation 
rules.  VP’  (the  translation  of  the  VP)  is  defined  to  be  the  value  of  (SAP  (SAP  COMP’  TENSE’)  V’), 
where  COMP’  is  a  constant  X-expression  and  SAP  is  the  symbolic- application  operator.  This  works  out 


to  be  (X  X  (past  (go  X))).  Finally,  the  symbolic  application  of  VP*  to  NP’  yields  (past  (go  John)).  (For 
convenience  we  shall  henceforth  use  square  brackets  for  SAP  and  designate  (SAP  a  0)  by  a(dj.) 

Before  describing  the  symbolic-application  operator  in  more  detail,  it  is  necessary  to  explain 
the  exact  nature  of  the  data  objects  serving  as  translations.  At  one  level,  it  is  convenient  to  think  of  the 
translations  as  X-expressions,  since  X -expressions  are  a  convenient  notation  for  specifying  how  fragments 
of  a  translation  are  substituted  into  their  appropriate  operator-operand  positions  in  the  formula  being 
assembled — especially  when  the  composition  rules  follow  the  syntactic  structure  as  encoded  in  the  parse 
tree.  There  are  several  phenomena,  however,  that  require  the  storage  of  more  information  at  a  node 
than  can  be  represented  in  a  X -express ion.  Two  of  the  most  conspicuous  phenonema  of  this  type 

are  quantifier  scoping  and  unbounded  dependencies  (“gaps”). 

Our  approach  to  quantifier  scoping  has  been  to  take  a  version  of  Cooper’s  storage  technique, 
originally  proposed  in  the  context  of  model-theoretic  semantics,  (Co-]  and  adapt  it  to  the  needs  of 
translation.  For  the  time  being,  let  us  take  translations  to  be  ordered  pairs  whose  first  component  (the 
head)  is  an  expression  in  the  target  language,  characteristically  a  X-expression.  The  second  component 
of  the  pair  is  an  object  called  itorage,  a  structured  collection  of  sentential  operators  that  can  be  applied 
to  a  sentence  matrix  in  such  a  way  as  to  introduce  a  quantifier  and  “capture”  a  free  variable  occurring 
in  that  sentence  matrix.2 

For  example,  the  translation  of  “a  happy  man”  might  be  <  m  ,  (X  S  (some  m  (and  (man 
m  )(happy  m))  S))  >  .*  Here  the  head  is  m  (simply  a  free  variable),  and  storage  consists  of  the  X-expression 
(X  S  ...).  If  the  verb  phrase  “sleeps”  were  to  receive  the  translation  <  (X  X  (sleep  X)),  <t>  >  (i.e.,  a  unary 
predicate  as  head  and  no  storage),  then  the  symbolic  application  of  the  verb  phrase  translation  to  the 
noun  phrase  translation  would  compose  the  heads  in  the  usual  way  and  take  the  “union”  of  the  storage 
yielding  <  (sleep  m),  (X  S  (some  m  (and  (man  m)(happy  m))  S))  >. 

We  define  an  operation  called  “pull.s,”  which  has  the  effect  of  “pulling”  the  sentence  operator 
out  of  storage  and  applying  it  to  the  head.  There  is  another  pull  operation,  pull.v,  which  operates 
on  heads  representing  unary  predicates  rather  than  sentence  matrices.  When  pulls  is  applied  in  our 
example,  it  yields  <  (some  m  (and  (man  m)(happy  m))  (sleep  m)),  4>  >,  corresponding  to  the  translation 

2!d  the  sample  grammar  presented  in  Appendix  A,  the  storage-forming  operation  is  notated  mk.mii. 

“Following  [Mo8lj,  a  quantified  expression  is  of  the  form  (f mantifUr,  ea rMU,  rtttritUon,  Mg) 


of  the  clause  “a  happy  man  sleeps.”  Note  that,  in  the  process,  the  free  variable  m  has  been  “captured.” 
In  model-theoretic  semantics  this  capture  would  ordinarily  be  meaningless,  although  one  can  complicate 
the  mathematical  machinery  to  achieve  the  same  effect.  Since  translation  is  fundamentally  a  syntactic 
process,  however,  this  operation  is  well-defined  and  quite  natural. 

To  handle  gaps,  we  enriched  the  translations  with  a  third  component:  a  variable  corresponding 
to  the  gapped  position.  For  example,  the  translation  of  the  relative  clause  “. . .  (that]  the  man  saw* 
would  be  a  triple:  <  (past  (see  X  Y)),  Y,  (X  S  (the  X  (man  X)  S))>,  where  the  second  component,  Y, 
tracks  the  free  variable  corresponding  to  the  gap.  At  the  node  at  which  the  gap  was  to  be  discharged, 
X-abstraction  would  occur  (as  specified  in  the  grammar  by  the  operation  “ungap”),  thereby  producing  the 
unary  predicate  (X  Y  (past  (see  X  Y))),  which  would  ultimately  be  applied  to  the  variable  corresponding 
to  the  head  of  the  noun  phrase. 

It  turns  out  that  triples  consisting  of  <head,  var,  storage >  are  adequate  to  serve  as  translations 
of  a  large  class  of  phrases,  but  that  the  application  operator  needs  to  distinguish  two  subcases  (which 
we  call  type  A  and  type  B  objects).  Until  now  we  have  been  discussing  type  A  objects,  whose  application 
rule  is  given  (roughly)  as 

< hd .var.sto > [ < hd’, var ’,sto’ > J  «■  <(hd  hd’),var  U  var',  sto  U  eto’>  , 

where  one  of  var  or  var’  must  be  null.  In  the  case  of  type  B  objects,  which  are  assigned  primarily  as 
translations  of  determiners,  the  rule  is 

<hd,var,sto>{<hd',var',8U»'>j  —  <var,  var’,  (hd  hd’)  U  sto  U  »to’> 

For  example,  if  the  meaning  of  “every”  is 

every’  =  <(X  P  (X  S  (every  X  (P  X)  S))),  X,  d> 

and  the  meaning  of  “man”  is 

man’  *  <  man,  <t>  > 

then  the  meaning  of  “every  man”  is 

every’|man’|  =*  <  X  ,  tf>,  (X  S  (man  X)  S)>  , 

as  expected. 

Nondeterminism  arises  in  two  ways.  First,  since  pull  operations  can  be  invoked  nondeterm  in  is- 


tically  at  various  nodes  in  the  parse  tree  (as  specified  by  the  grammar),  there  exists  the  possibility  of 
computing  multiple  scopings  for  a  single  context-free  parse  tree.  (See  Section  3.3.2  for  an  example  of 
this  phenomenon.)  In  addition,  the  grammar  writer  can  specify  explicit  nondeterminism  by  associating 
several  distinct  translation  rules  with  a  single  context-free  production.  In  this  case,  he  can  control  the 
application  of  a  translation  schema  by  specifying  for  each  schema  a  guard,  i.e.,  a  Boolean  combination 
of  features  that  the  nodes  analyzed  by  the  production  must  satisfy  for  the  translation  schema  to  be 
applicable. 


3.2.4  Implementation  of  a  Translation  System 

The  techniques  presented  in  Sections  3.2.2  and  3.2.3  were  implemented  in  a  parsing  and 
translation  system  called  PATR,  which  was  used  as  a  component  in  a  dialogue  system  discussed  in 
Section  3.3.3.  The  input  to  the  system  is  a  sentence,  which  is  preprocessed  by  a  lexical  analyzer.  Parsing 
is  performed  by  a  simple  recursive  descent  parser,  augmented  to  add  annotations  to  the  nodes  of  the 
parse  tree.  Translation  is  then  done  in  a  separate  pass  over  the  annotated  parse  tree.  Thus,  the  four 
conceptual  phases  are  implemented  as  three  actual  processing  phases.  This  folding  of  two  phases  into  one 
was  done  purely  for  reasons  of  efficiency  and  has  no  effect  on  the  actual  results  obtained  by  the  system. 
Functions  to  perform  the  storage  manipulation,  gap  handling,  and  the  other  features  of  translation 
presented  earlier  have  all  been  realized  in  the  translation  component  of  the  running  system.  The  next 
section  describes  an  actual  grammar  that  has  been  used  in  conjunction  with  this  translation  system. 

3.3.  Experiments  in  Producing  and  Using  Logical  Form 


3.3. 1  A  Working  Grammar 

To  illustrate  the  ease  with  which  diverse  semantic  features  could  be  handled,  a  grammar  was 
written  that  defines  a  semantically  interesting  fragment  of  English  along  with  its  translation  into  logical 
form  (Mo81|.  The  grammar  for  the  fragment  illustrated  in  this  dialogue  is  compact,  occupying  only 
a  few  pages,  yet  it  gives  both  syntax  and  semantics  for  modals,  tense,  aspect,  passives,  and  lexically 
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controlled  infinitival  complements.  (A  portion  of  the  grammar  is  included  as  Appendix  A.)4  The  full  test 
grammar,  loosely  based  on  DIAGRAM  |RoS2]  but  restricted  and  modified  to  reflect  changes  in  approach, 
was  the  grammar  used  to  specify  the  translations  of  the  sentences  in  the  sample  dialogue  of  Appendix 

C. 

3.3.2  An  Example  of  the  System’s  Operation 

The  grammar  presented  in  Appendix  A  encodes  a  relation  between  sentences  and  logical-form 
expressions.  We  now  present  a  sample  of  this  relation,  as  well  as  its  derivation,  with  a  sample  sentence: 
“Every  man  persuaded  a  woman  to  go.” 

Lexical  analysis  relates  the  sample  sentence  to  two  morpheme  streams: 

•  every  man  &ppl  persuade  a  woman  l»  go 

•  every  man  &past  persuade  a  woman  > 

The  first  is  immediately  eliminated  because  there  is  no  context-free  parse  for  it  in  the  grammar.  The 

second,  however,  is  parsed  as 

[S  (SDEC  (IP  (SET?  (DDET  (DET  every))) 

(■OK  (I0MHD  (1001  (I  ana))))) 

(PREDICATE  (A DIP  (TEISE  A past)) 

(fPP  (TP  (TPT  (T  persuade))) 

(IP  (DETP  (A  a» 

(101  (I01HD  (1001  (I  voaan))))) 

(IIFIIITITE  (TO  to) 

(TPP  (TP  (TPT  (T  go] 

While  parsing  is  being  done,  annotations  are  added  to  each  node  of  the  parse  tree.  For  instance, 
the  NP  -*  DETP  NOM  rule  includes  the  annotation  rule  AGREE(  NP,  DETP,  Definite  ).  AGREE  is  one 
of  a  set  of  macros  defined  for  the  convenience  of  the  grammar  writer.  This  particular  macro  invocation 
is  equivalent  to  the  Boolean  expression  Definite(NP)  «*  Definite(DETP).  Since  the  DETP  node  itself  has 
the  annotation  Definite  as  a  result  of  the  preceding  annotation  process,  the  NP  node  now  acquires  the 

4S'mee  (hi*  is  ju«t  a  smalt  portion  of  tho  actual  grammar  (elected  for  expository  purposes,  many  of  the  phrasal  categories 
and  annotations  will  seem  unmotivated  and  needlessly  complex.  These  categories  and  annotations  err  utilised  elsewhere 
in  the  test  grammar. 


annotation  Definite  as  well.  At  the  bottom  level,  the  Definite  annotation  was  derived  from  the  lexical 

entry  for  the  word  “every.”5  The  whole  parse  tree  receives  the  following  annotation: 

[S  (SDEC  (IP:  Definite 

(DETP:  Definite 
(DDE? :  DeSnite 
(DEI:  DeSnite 
•very))) 

(■OH  (I0MHD  (1001  (I  mlb))))) 

(PREDICATE  (A DIP  (TEISE  ipast)) 

(TP?  (TP:  Active 

(TPT:  Active, Transitive, Takeslnf 
(T:  Active,  Transitive,  Takeslnf 
psnuds))) 

(IP  (DETP  U  »)) 

(10M  (I0MHD  (10HI  (I  vobbb))))) 

(IIFI1ITITE  (TO  to) 

(TPP  (TP:  Active 

(TPT:  Active 
(T:  Active 
go] 

Finally,  the  entire  annotated  parse  tree  is  traversed  to  assign  translations  to  the  nodes  through 

a  direct  implementation  of  the  process  described  in  Section  3.2.3.  (Type  A  and  B  objects  in  the  following 

examples  are  marked  with  a  prefix  ‘A:’  or  ‘B:’.)  For  instance,  the  VP  node  covering  (persuade  a  woman 

to  go),  has  the  translation  rule  VPT’(NP’](INFINrnVE'|.  When  this  is  applied  to  the  translations  of 

the  node’s  constituents,  we  have 

<A:  (X  X  (X  P  (X  T  (persuade  T  X  (P  X)))> 

[< A :  X2,  <)>,  (X  S  (non*  X2  (vonan  X2)  S))>] 

[<A:  (X  X  (go  X))>] 

which,  after  the  appropriate  applications  are  performed,  yields 

<A:  (X  P  (X  T  (persuade  T  X2  (P  12)))).  </>. 

(X  S  (sons  X2  (vonan  X2)  S))> 

[<A:  (X  X  (go  X))>] 

=  <A:  (X  T  (persuade  T  X2  (go  X2))),  i>,  (X  S  (none  X2  (vonan  X2)  S))> 

After  the  past  operator  has  been  applied,  we  have 

<A:  (X  T  (pant  (persuade  T  X2  (go  X2)))),  <t>,  (X  S  (none  X2  (vonan  X2)  S))> 


®Note  that,  although  tb«  annotation  phase  wa«  deaeribcd  and  ia  implemented  procedurally,  the  process  actualljr  nsed 
guarantees  that  the  resulting  annotation  is  exactly  the  one  specified  deelaratively  by  the  annotation  rules. 


At  this  point,  the  pull  operator  (puil.v)  can  be  used  to  bring  the  quantifier  out  of  storage,  yielding* 
<1:  (X  T  (som  X2  (vonaa  X2)  (pant  (persuade  Y  X2  (go  12))))),  <j>,  <$>> 


This  will  ultimately  result  in  “a  woman”  getting  narrow  scope.  The  other  alternative  is  for  the  quantifier 
to  remain  in  storage,  to  be  pulled  only  at  the  full  sentence  level,  thus  resulting  in  the  other  scoping. 
In  Figure  2  we  have  added  the  translations  to  all  the  nodes  of  the  parse  tree.  Nodes  with  the  same 
translations  as  their  parents  were  left  unmarked.  After  examination  of  the  S  node  translations,  the 
original  sentence  is  given  the  fully  scoped  translations 

(every  12  (nan  X2) 

(som  XI  (sons  XI)  (past  (persuade  X2  XI  (go  XI))))) 
and 

(som  XI  (foiu  XI) 

(every  X2  (nan  X2)  (past  (persuade  X2  XI  (go  XI))))) 


3.3.3  A  Simple  Question- Answering  System 

As  mentioned  in  Section  3.1,  we  were  able  to  demonstrate  the  semantic  capabilities  of  our 
language  system  by  assembling  a  small  question-answering  system.  Our  strategy  was  to  first  translate 
English  into  logical  formulas  of  the  type  discussed  in  [Mo8l],  which  were  then  postprocessed  into  a  form 
suitable  for  a  first-order  deduction  system.  We  used  a  connection  graph  theorem  prover,  described  in 
Section  4  of  this  report.  (Another  possible  approach  would  have  been  to  translate  directly  into  first- 
order  logic,  or  to  develop  direct  proof  procedures  for  the  non-first-order  language.)  Thus,  we  were  able 
to  integrate  all  the  components  into  a  question-answering  system  by  providing  a  simple  control  structure 
that  accepted  an  input,  translated  it  into  logical  form,  reduced  the  translation  to  first-order  logic,  and 
then  either  asserted  the  translation  in  the  case  of  declarative  sentences  or  attempted  to  prove  it  in  the 
case  of  interrogatives.  (Only  yes/no  questions  have  been  implemented.) 

The  main  point  of  interest  is  that  our  question-answering  system  was  able  to  handle  complex 
semantic  entailments  involving  tense,  modality,  etc. — and  that,  moreover,  it  was  not  restricted  to 

•For  convenience,  when  »  final  constituent  of  a  translation  is  4  it  is  often  not  written.  Thus  we  could  have  written  <A: 

(X  Y  (some  ...)  ...)>  in  this  case. 
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(S:  <A:  (put  (persuade  XI  X2  (go  X2) ) ) ,  <t>, 

(X  S  (every  XI  (nan  XI)  S)) 

(X  S  (soae  X2  (voaaa  X2)  S))>, 

<A:  (soae  X2  (voaaa  X2)  (past  (persaade  XI  X2  (go  X2)))).  <f>, 

(X  S  (erery  XI  (aaa  XI)  S))> 

<A:  (erery  X2  (aaa  X2) 

(soae  XI  (soaaa  XI)  (past  (persaade  X2  XI  (go  X2)))))> 
<A:  (soae  XI  (voaaa  XI) 

(erery  X2  (aaa  X2)  (past  (persaade  X2  XI  (go  X2)))))> 

(SDEC 

(IP:  <A:  XI,  <t>,  (X  S  (erery  XI  (aaa  XI)  S))> 

(DETP:  <B:  (X  P  (X  S  (erery  X  (P  X)  S))). 

X> 

(DDET  (DET  erery))) 

(10M:  <A:  (X  X  (aaa  X))> 

(I0I1HD  (1001  (I  aaa))))) 

(PREDICATE:  <A:  (X  X  (past  (persaade  T  X2  (go  X2)))),  <t>. 

(X  S  (soae  X2  (voaaa  X2)  S))>, 

<A:  (X  X  (soae  X2  (voaaa  X2) 

(past  (persaade  T  X2  (go  12))))). 

<(>,  $> 

(A0XP:  <A:  (X  P  (X  X  (past  (P  X))))> 

(TE1SE  Apast) ) 

(TPP:  <A:  (X  T  (persaade  T  X2  (go  X2))).  <t>. 

(X  S  (soae  X2  (voaaa  X2)  S))> 

(TP  (TPT:  <A:  (X  X 

(X  P 

(X  T  (persaade  T  X  (P  T)))> 

(V  persaade))) 

(IP:  <A:  X2.  <t>.  (X  S  (soae  X2  (voaaa  X2)  S))> 

(DETP:  <B:  (X  P  (X  S  (soas  X  (P  X)  S))). 

X> 

(A  a)) 

(I0M:  <A:  (X  X  (voaaa  X))> 

(I0MHD  (1001  (I  voaaa))))) 

(IIFIIITITE  (TO:  aoae 
to) 

(TPP:  <A:  (X  X  (go  X))> 

(TP  (TPT  (T  go] 

Figure  2:  Node-by-node  translation  of  a  sample  sentence 


extensional  evaluation  in  a  database,  as  with  conventional  question-answering  systems.  For  example, 
our  system  was  able  to  handle  the  entailments  of  sentences  like 

•  John  could  not  have  been  persuaded  to  go. 

(The  transcript  of  a  sample  dialogue  is  included  as  Appendix  C.) 
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IIPUT:  every  nan  aaat  be  happy 
LF:  (every  Z  (aaa  Z) 

(necessary  (and  (happy  Z) 

(thing  Z))» 

FOL:  (every  x0172 

(inplies  (nan  REALVORLD  x0172) 

(every  v0173 

(inplies  (poss  BEALVOBLO  *0173) 

(and  (happy  *0173  x0172) 

(thing  *0173  X0172)))))) 

IIPOT:  bill  persuaded  John  to  go 

LF:  (past  (persuade  bill  John  (go  John))) 

FOL:  (soae  *0175 

(and  (past  *0175  REAL1QRLD) 

(soae  *0178 

(and  (persuade  *0175  bill  John  *0176) 

(go  *0176  John))))) 

Figure  9:  Translation  to  LF  and  Reduction  to  FOL 

The  reduction  of  logical  form  to  first-order  logic  (FOL)  was  parameterized  by  a  set  of  recur¬ 
sive  expansions  for  the  syntactic  elements  of  logical  form  in  a  manner  similar  to  Moore’s  use  of  an 
axiomatization  of  a  modal  language  of  belief.  [MoSOj  For  example,  (past  P)  is  expanded,  with  respect  to 
a  possible  world  w,  as 

(some  w2  (and  (past  w2  w)  <P,w2>))  , 


where  “<P,w2>”  denotes  the  recursive  FOL  reduction  of  P  relative  to  the  world  w2.  The  logical  form 
that  was  derived  for  the  sample  sentence  “John  went”  therefore  reduces  to  the  first-order  sentence 


(some  w  (and  (past  w  REALWORLD)  (go  w  John))) 

More  complicated  illustrations  of  the  results  of  translation  and  reduction  are  shown  in  Figure  3.  Note, 
for  example,  the  use  of  restricted  quantification  in  LF  and  ordinary  quantification  in  FOL. 

To  compute  the  correct  semantic  entailments,  the  deduction  system  was  preloaded  with  a  set  of 
meaning  postulates  (axioms)  giving  inferential  substance  to  the  predicates  associated  with  lexical  items 
(see  Appendix  B). 


3.4.  Further  Extensions 


We  are  continuing  to  refine  the  grammar  formalism  and  improve  the  implementation.  Some  of 
the  refinements  are  intended  to  make  the  annotations  and  translations  easier  to  write.  Examples  include 

•  Allowing  nonbinary  features,  along  with  sets  of  values,  in  the  annotations  and  guards  (extending  the 
language  to  include  equality  and  set  operations). 

•  Generalizing  the  language  used  to  specify  synthesis  of  logical  forms  and  developing  a  more  uniform 
treatment  of  translation  types. 

•  Generalizing  the  “gap”  variable  feature  to  handle  arbitrary  collections  of  designated  variables  by  using 
an  “environment”  mechanism.  This  is  useful  in  achieving  a  uniform  treatment  of  free  word  order  in 
verb  complements  and  modifiers. 

In  addition,  we  are  working  on  extensions  of  the  syntactic  machinery,  including  phrase-linking 
grammars  to  handle  displacement  phenomena  [PR81j,  and  methods  for  generating  the  augmented  phrase- 
structure  grammar  through  a  metarule  formalism  similar  to  that  of  [Ko80j.  We  have  also  experimented 
with  alternative  parsing  algorithms,  including  a  chart  parser  [BK79j  adapted  to  carry  out  annotation 
and  translation  in  the  manner  described  in  this  paper. 

Appendix  A.  Sample  Grammar  Rules 

The  following  is  a  portion  of  a  test  grammar  for  the  PATR  English  translation  system.  Only 
those  portions  of  the  grammar  utilized  in  analyzing  the  sample  sentences  in  the  text  were  included.  The 
full  grammar  handles  the  following  constructs:  modals,  adjectivals,  tense,  predicative  and  nonpredicative 
copulatives,  adverbials,  quantified  noun  phrases,  aspect,  NP,  PP,  and  infinitival  complements,  relative 
clauses,  yes/no  questions,  restricted  wh-questions,  noun-noun  compounds,  passives,  and  prepositional 
phrases  as  predicates  and  adjectivals. 

==========*=»  Gran  ST  Rules  sss*s*s»sass»ssssasa 

Constant  Ed*  =  curry  (LAMBDA  (X  T)  (equal  XT)) 

Constant  PASS’  = 

<A:  (LAMBDA  P  (LAMBDA  X  ((P  X)  T»).  MIL, 

(MK.MBD  (QUOTE  (LAMBDA  S  (none  7  (thing  T)  S))»  > 
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Constant  PASSIIF'  - 

<A:  (LAMBDA  P  (LAMBDA  I  (LAMBDA  X  (((P  X)  1)  1)))).  1IL 
(MX.MBD  (QUOTE  (LAMBDA  S  (sobs  T  (thing  T)  S)»)  > 

AUXP  ->  TE1SE; 

Translation: 

TEISE’ 

DDET  ->  DET; 

Annotation : 

[  Definite (DDET)  J 
Translation: 

DET' 

DETP  ->  A; 

Annotation : 

[  -■Definite (DETP)  ] 

Translation: 

A* 

DETP  ->  DDET; 

Annotation : 

[  AGREE (DETP,  DDET.  Definite)  ] 

Translation: 

DDET' 

IIFIIITITE  ->  TO  TPP; 

Annotation : 

[  AGREE (IMF IIITITE,  TPP.  Gappy.  Ih)  ] 

Translation: 

pnll.r(TPP') 

MOM  ->  I0MHD; 

I  Annotation : 

(  AGREE (MOM.  I0MHD.  Gappy)  ] 
j  Tranalatioa: 

I0HHD* 

I0MHD  ■>  MODI; 

Translation: 

1001' 

1001  ->  I; 

Translation: 

I’ 

IP  ->  DETP  I0M; 
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Annotation : 

[  AGREE (IP,  I0M,  Gappy)  1 
[  Predicative  (IP)  V  -Predicative (IP)  ] 

[  AGREE (IP.  DETP.  Definite)  ] 

Translation: 

-•PredicatiTe (IP) :  DETP*[I0M‘] 

Definite  (IP)  «  Predicative  (IP) :  EG*  [DETP*  [IOM'1] 
-•Definite  (IP)  A  Predicative (IP) :  I0M* 

PREDICATE  ->  ADIP  TPP; 

Annotation : 

[  AGREE (PREDICATE,  TPP.  Active,  Gappy,  It)  3 
Translation: 

pull.  v(AOXP’ [TPP*]) 

S  ->  SDEC; 

Annotation : 

[  -•Gappy (SDEC)  ] 

[  -It  (SDEC)  ] 

Translation: 

SDEC* 

SDEC  ->  IP  PREDICATE: 

Annotation : 

[  Gappy (IP)  V  Gappy (PREDICATE)  *  Gappy (SDEC)  ] 
[  -Predicative (IP)  ] 

[  It(IP)  V  It  (PREDICATE)  «*  It  (SDEC)  ] 

[  —(Gappy (IP)  A  Gappy (PREDICATE))  3 
Translation: 

pall. a (PREDICATE* [IP*]) 

TP  ->  TPT; 

Annotation : 

[  -Transitive (TPT)  ] 

[  -Takeslnf (TPT)  ] 

[  Active  (TPT)  ] 

[  Active (TP)  3 
Translation: 

TPT* 

TP  ->  TPT  IP  IIFIIITITE; 

Annotation : 

[  Takeslnf (TPT)  3 
[  Transitive (TPT)  3 
[  -Predicative (IP)  3 
[  AGREE (TP,  TPT,  Active)  3 
[  It  (IP)  V  It  (IIFIIITITE)  **  It  (TP)  ) 

[  IF  (Active  (TPT), 
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((Gappy  (IP)  V  Gappy  (IIP IIITITE) )  <=»  Gappy  (TP)). 

ft  -(Gappy (IP)  ft  Gappy (IIPIIITITE)). 

(-Gappy (TFT)  ft  Gappy (IP)))  ] 

Translation : 

Active  (TP) :  pnll. t(TPT’ [IP1] [IIPIIITITE’]) 
-Active(TP)  :  pnll.  t  (PAS  SI  IP’  tYPT*]  [IIPIIITITE’]) 

TPP  ->  TP; 

Annotation : 

[  AGREE (TPP.  TP,  Gappy.  1ft)  ] 

[  Ac tiro (TP)  ] 

Translation: 

?P. 

TPT  ->  T; 

Annotation : 

[  AGREE (TPT,  T,  Act Its,  TraasitlTO,  Takas Inf)  ] 
Translation: 

T’ 


Lexicon  == 


I  ->  aan; 

Translation : 

<A:  aan.  IIL,  IIL  > 

I  ->  woaan; 

Translation: 

<A:  woaan.  IIL,  IIL  > 

DET  ->  ewery; 

Annotation : 

[  Definite (DET)  ] 

Translation : 

<B:  (LAMBDA  P  (LAMBDA  S  (every  Z  (P  Z)  S))),  Z,  IIL  > 

A  ->  a; 

Translation: 

<B:  (LAMBDA  P  (LAMBDA  S  (some  Z  (P  Z)  S))).  Z,  IIL  > 

T  ->  persnade; 

Annotation : 

[  Transitive(T)  ] 

[  Active  (T)  V  -Active (T)  ] 

[  Takeslnf (T)  ] 

Translation: 

carry  (LAMBDA  (Z  P  T)  (persnade  T  Z  (P  Z))) 
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X  ->  go; 

Annotation : 

[  ->Transitive(¥)  1 
[  ->TakesInf  (?)  ] 

[  Active (V)  ] 

Translation: 

<A:  go,  IIL,  IIL  > 

TEISE  ->  hpast; 

Translation: 

carry  (LAMBDA  (P  X)  (past  (P  X))) 


Appendix  B.  Meaning  Postulates 

[every  a  (every  a  (iff  (past  *  a) 

(aot  (past  a  a] 

(every  a  (soae  a  (past  v  a))) 

[every  *  (every  x  (every  y  (every  z  (iaplies  (promise  «  x  y  z) 

(past  a  z] 

[every  a  (every  x  (every  y  (every  z  (implies  (persasde  a  x  y  z) 

(past  a  z] 

(every  a  (every  x  (thing  a  x))) 

[every  a  (every  x  (every  z  (implies  (want  a  x  z) 

(past  a  z] 

(every  a  (pose  a  a)) 

[every  a  (every  a  (iaplies  (past  a  a) 

(pose  a  a] 

[every  a  (every  a  (every  v  (iaplies  (sad  (pasti  a  a) 

(pastl  a  v)) 

(past2  a  v] 

[every  a  (every  z  (iaplies  (pasts  a  z) 

(past  a  z] 

[every  a  (every  z  (iff  (past  a  z) 

(pastl  a  z] 


Appendix  C.  Transcript  of  Sample  Dialogue 

»  John  is  happy 
OK. 

»  is  John  happy 

Tes. 
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»  is  jobs  &  happy  a as 
I  don't  know. 


»  John  is  a  nan 
OK. 

»  is  John  a  happy  nan 
Tos. 

»  no  nan  coaid  haws  hidden  a  book 
OK. 

»  did  John  hide  a  book 

lo. 

»  bill  hid  a  book 
OK. 

»  is  bill  a  nan 

lo. 

»  was  John  a  nan 
I  don't  know. 

»  eTery  nan  will  be  a  nan 


»  will  John  be  a  nan 
Tes. 

»  bill  persnaded  John  to  go 
OK. 

»  coaid  Joha  have  been  persaaded  to  go 
Tes. 

»  will  John  be  persaaded  to  go 
I  don't  know. 


29 


4.  A  Nonclausal  Connection-Graph  Resolution  Theorem-Proving  Program 


This  section  was  written  by  Mark  E.  Stickel. 

4.1.  Introduction 

This  section  describes  some  of  the  theory  and  features  of  a  nonclausal  connection-graph  resolu¬ 
tion  theorem-proving  program  being  developed  as  a  reasoning  component  for  KLAUS. 

The  most  important  characteristics  of  the  program  are 

•  Nonclausal  resolution  is  used  as  the  inference  system,  eliminating  some  of  the 

redundancy  and  unreadability  of  clause-based  systems. 

•  A  connection  graph  is  used  to  represent  permitted  resolution  operations,  restrict¬ 
ing  the  search  space  and  facilitating  the  use  of  graph  searching  for  efficient  deduc¬ 
tion, 

•  Heuristic  search  and  special  logical  connectives  are  used  for  program  control. 

The  following  sections  will  describe  these  aspects  of  the  program,  citing  disadvantages  and 
difficulties  as  well  as  advantages,  and  will  be  followed  by  a  description  of  the  implementation  status  of 
the  program  and  future  plans  for  it. 


4.2.  Nonclausal  Resolution 

One  of  the  most  widely  criticized  aspects  of  resolution  theorem  proving  is  its  use  of  clause  form 
for  wffs.  The  principal  criticisms  are 

•  Conversion  of  a  wff  to  clause  form  may  eliminate  pragmatically  useful  information 
encoded  in  the  choice  of  logical  connectives  (e.g.,  ->PVQ  may  suggest  case  analysis 
while  the  logically  equivalent  P  D  Q  may  suggest  chaining). 

•  Use  of  clause  form  may  result  in  a  large  number  of  clauses  being  needed  to 
represent  a  wff,  as  well  as  in  substantial  redundancy  in  the  resolution  search 


space. 


•  Clause  form  is  difficult  to  read  and  not  human-oriented. 

The  clausal  resolution  rule  can  be  easily  extended  to  general  quantifier-free  wffs  [Mu82,MW80j. 
Proofs  of  soundness  and  completeness  are  in  [Mu82|.  Where  clausal  resolution  resolves  on  clauses  con¬ 
taining  complementary  literals,  nonciausal  resolution  resolves  on  general  quantifier-free  wffs  containing 
atomic  wffs  (atoms)  occurring  with  opposite  polarity,  which  is  determined  by  the  parity  of  the  number 
of  explicit  or  implicit  negations  in  whose  scope  the  atom  appears  (positive  polarity  if  even,  negative 
polarity  if  odd).  In  clausal  resolution,  resolved-on  literals  are  deleted  and  remaining  literals  disjoined 
to  form  the  resolvent.  In  nonciausal  resolution,  all  occurrences  of  the  resolved-on  atom  are  replaced  by 
F= false  (T=true)  in  the  wff  in  which  it  occurs  positively  (negatively).  The  resulting  wffs  are  disjoined 
and  simplified  by  truth-functional  reductions  that  eliminate  embedded  occurrences  of  T  and  F  and 
optionally  perform  simplifications  such  as  A  A  A  -*  F. 

Definition  4.1.  If  A  and  B  are  ground  wffs  and  C  is  an  atom  occurring  positively  in  A  and  negatively 
in  B,  then  the  result  of  simplifying  A[C  «-  F)  V  B(C  *-  T),  where  Z(1/  «-  2)  is  the  result  of  replacing 
every  occurrence  of  y  in  Z  by  2,  is  a  ground  nonclautal  resolvent  of  A  and  B. 

It  is  clear  that  nonciausal  resolution  reduces  to  clausal  resolution  when  the  wffs  are  restricted 
to  be  clauses.  In  the  general  case,  however,  nonciausal  resolution  has  some  novel  characteristics  as 
compared  with  clausal  resolution.  It  is  possible  to  derive  more  than  one  resolvent  from  the  same  pair 
of  wffs,  even  resolving  on  the  same  atom,  if  the  atom  occurs  both  positively  and  negatively  in  both  wffs 
(e.g.,  atoms  within  the  scope  of  mi  equivalence  occur  both  positively  and  negatively).  Likewise,  it  is 
possible  to  resolve  a  wff  against  itself. 

The  ground  nonciausal  resolution  rule  can  be  lifted  to  nonground  wffs  by  renaming  parent  wffs 
apart  and  unifying  sets  of  atoms  from  each  parent,  one  atom  of  each  set  occurring  positively  in  the 
first  wff  and  negatively  in  the  second.  As  with  clausal  resolution,  only  single  atoms  need  be  resolved 
upon  if  the  resolution  operation  is  augmented  by  a  factorization  operation  that  derives  a  new  wff  by 
instantiating  a  wff  by  a  most  general  unifier  of  two  or  more  distinct  atoms  occurring  in  the  wff  (regardless 
of  polarity). 

A  nonciausal  resolution  derivation  of  F  from  a  set  of  wffs  demonstrates  the  unsatisfiability  of 
the  set  of  wffs.  Nonciausal  resolution  is  thus,  like  clausal  resolution,  a  refutation  procedure.  Variants 


of  the  procedure  that  attempt  to  affirm  rather  than  refute  a  wff  are  possible  (e.g.,  see  the  variety  of 
resolution  rules  in  (MW80j),  but  are  isomorphic  to  this  procedure. 


Although  clause  form  is  often  criticized,  use  of  nonclausal  form  has  the  disadvantage  that  most 
operations  on  nonclausal  form  are  more  complex  than  the  same  operations  on  clause  form.  The  result 
of  a  nonclausal  resolution  operation  is  less  predictable  than  the  result  of  a  clausal  resolution  operation. 
Clauses  can  be  represented  as  lists  of  literals;  sublists  are  appended  to  form  the  resolvent.  Pointers 
can  be  used  to  share  lists  of  literals  between  parent  and  resolvent  [BM72).  With  many  simplifications 
such  as  A  A  T  -*  A  and  A  A  ~<A  —  P  being  applied  during  the  formation  of  a  nonclausal  resolvent, 
the  appearance  of  a  resolvent  may  differ  substantially  from  its  parents,  making  structure  sharing  more 
difficult. 


For  most  forms  of  clausal  resolution,  an  atom  does  not  occur  more  than  once  in  a  clause.  In 
nonclausal  resolution,  an  atom  may  occur  any  number  of  times,  with  possibly  differing  polarity.  In 
clausal  resolution,  every  literal  in  the  clause  must  be  resolved  upon  for  the  clause  to  participate  in  a 
refutation.  Thus  if  a  clause  contains  a  literal  that  is  pure  (cannot  be  resolved  with  a  literal  in  any 
other  clause),  the  clause  can  be  deleted.  This  is  not  the  case  with  nonclausal  resolution;  not  all  atom 
occurrences  are  essential  in  the  sense  that  they  must  be  resolved  upon  to  participate  in  a  refutation.  For 
example,  {  P  A  Q,  ~>Q  }  is  a  minimally  inconsistent  set  of  wffs,  one  of  which  contains  the  pure  atom  P. 
A  more  complicated  definition  of  purity  involving  this  notion  of  essential  occurrences  must  be  used.  The 
subsumption  operation  must  also  be  redefined  for  nonclausal  resolution  to  take  account  of  such  facts  as 
the  subsumption  of  A  by  A  A  B  as  well  as  the  clausal  subsumption  of  A  V  B  by  A. 

[Mu82,MW80|  suggest  the  extension  of  nonclausal  resolution  to  resolving  on  nonatomic  sub  wffs 
of  pairs  of  wffs.  For  example,  P  V  Q  and  (P  V  <2)  D  R  could  be  resolved  to  obtain  R.  Resolving  on 
nonatoms  often  permits  significantly  shorter  and  more  readable  refutations.  However,  there  are  several 
reasons  for  not  doing  this: 

•  It  may  be  difficult  to  recognize  complementary  wffs.  For  example,  P  V  Q  occurs 
positively  in  Q  V  R  V  P  and  ~>P  D  Q. 

•  The  effect  of  resolving  a  pair  of  wffs  on  nonatomic  subwffs  can  be  achieved 
by  multiple  resolution  operations  on  atoms.  Resolution  on  both  atomic  and 
nonatomic  subwffs  could  result  in  redundant  derivations. 
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•  A  connection-graph  procedure  would  be  complicated  by  the  need  to  attach  links 
to  logical  subwffs  (e.g.,  PyQ  ia  QvRvP)  and  link  inheritance  would  be  further 
complicated  since  subwffs  of  a  resolvent  may  have  no  parent  subwffs  (e.g.,  when 
PVQ  and  ->PVR  are  resolved,  the  resolvent  QyR  is  a  subwff  of  neither  parent). 

Similar  complications  arise  if  equality  inferences  are  used  that  introduce  new 
structure  into  the  result. 

Although  the  nonciausal  resolution  rule  in  general  seems  adequate  as  compared  with  the  above 
proposed  extension  to  matching  on  nonatomic  subwffs,  the  handling  of  the  equivalence  relation  in  [Mu82] 
is  inadequate.  In  resolving  P  s  Q  and  (P  A  R)  V  ("’P  A  S),  it  is  possible  to  derive  QvS  and  ->Q  V  R, 
but  not  the  more  natural  result  of  simply  replacing  P  by  Q.  It  is  questionable  whether  handling  the 
equivalence  relation  in  nonciausal  resolution  without  further  extension  is  worthwhile  in  comparison  with 
the  representational  advantages  of  negation  normal  form  used  in  |An81,Bi8l].  Another  difficulty  with 
the  equivalence  relation  is  that  it  sometimes  needs  to  be  removed  during  Skolemization.  It  is  somewhat 
ineffective  to  provide  an  inference  system  which  handles  equivalence,  but  may  require  its  removal  during 
Skolemization.  [MW82]  provides  extensions  to  nonciausal  resolution  that  defer  Skolemization  and  permit 
equivalence  relations  to  be  retained  longer. 

4.3.  Connection  Graphs 

Connection-graph  resolution  was  introduced  in  (Ko75).  It  has  the  following  advantages: 

•  The  connection-graph  refinement  is  quite  restrictive.  Many  resolution  operations 
permitted  by  other  resolution  procedures  are  not  permitted  by  connection-graph 
resolution. 

•  The  links  associated  with  each  wff  function  partially  as  indexing  of  the  wffs.  Effort 
is  not  wasted  in  the  theorem  prover  examining  the  entire  set  of  wffs  for  wffs  that 

can  be  resolved  against  newly  derived  wffs. 

•  Links  can  be  traversed  by  a  graph-searching  algorithm  whereby  each  link  traver¬ 
sal  denotes  a  resolution  operation.  This  can  be  done  to  plan  a  deduction  without 
actually  constructing  it.  This  graph  searching  may  resemble  the  searching  per¬ 
formed  for  deduction  in  knowledge  representation  languages. 


Connection-graph  resolution  is  extended  in  a  natural  way  to  use  the  nonclausal  resolution 
inference  rule. 

A  connection  graph  is  a  set  of  wffs  and  a  set  of  links  that  connect  atoms  occurring  with  positive 
polarity  in  one  wff  and  negative  polarity  in  the  same  or  another  wff.  Performing  the  nonclausal  resolution 
operation  indicated  by  the  link  results  in  the  production  of  a  new  connection  graph  with  the  resolved 
upon  link  eliminated  and  the  nonclausal  resolvent  added.  Roughly  speaking,  atoms  of  the  nonclausal 
resolvent  are  linked  only  to  to  atoms  to  which  atoms  of  the  parent  wffs  were  linked. 

Definition  4.2.  Let  S  be  a  set  of  ground  wffs.  Let  L  be 

{ (C,  A,  B)  |  A,  B  €  S ,  atom  C  occurs  positively  in  A  and  negatively  in  B  } 

Then  (S,L)  is  the  full  connection  graph  for  S. 

Definition  4.3.  Let  S  be  a  set  of  ground  wffs  and  L  be  its  connection  graph.  Let  l  —  (C,  A,  B)  be  an 
element  of  L  and  C  be  the  nonclausal  resolvent  A(C  «-  F)  V  B(C  *-  T).  Let  S’  be  S  U  {  C  }.  Let  L’  be 

U{  (E,  C,  D)  |  atom  E  occurs  positively  in  C  and 
{ E,A,D)eLor[E,B,D)eL } 

U{  (E,D,  C )  |  atom  E  occurs  negatively  in  C  and 
( E,D,A)elor[E,D,B)eL } 

U{  (E,  C,  C)  |  atom  E  occurs  positively  and  negatively  in  C  and 

{E,  A,  A)  €  L,  ( E ,  B,  B)  €  L,  {E,  A,  B)eL,  or  {E,  B,A)eL  }. 

Then  the  connection  graph  {S’,  L’)  is  derived  from  (S,  L)  by  ground  nonclausal  connection-graph  resolu¬ 
tion. 

A  nonclausal  connection-graph  resolution  refutation  of  an  input  set  of  wffs  is  a  derivation  of  a 
set  of  wffs  including  F  by  nonclausal  connection-graph  resolution  from  the  full  connection  graph  of  the 
input  set  of  wffs. 

Ground  nonclausal  connection-graph  resolution  can  be  extended  to  the  nonground  case  by 
including  in  the  links  the  unifier  of  the  atoms  they  connect,  keeping  wffs  renamed  apart,  and  by  including 
links  between  variants  of  the  same  wff  (to  allow  a  wff  to  directly  or  indirectly  resolve  against  a  variant 


of  itself).  Factorization  must  also  be  included.  Either  factors  with  appropriately  inherited  links  must 
be  added  for  each  wff  in  the  connection  graph  or  special  factor  links  can  be  used  with  link  inheritance 
rules  for  both  resolve  and  factor  links  after  resolution  and  factorization  operations. 

The  nonclausal  connection-graph  resolution  procedure  is  sound  and  there  is  reason  to  believe  it 
is  complete.  However,  it  has  not  yet  been  proved  to  be  complete,  and  the  history  of  proving  completeness 
of  connection-graph  procedures  for  the  simpler  clausal  case  (see  (Bi8l))  suggests  it  may  be  difficult. 

One  reason  it  is  difficult  to  prove  the  completeness  of  the  connection-graph  procedure  is  that  the 
link  inheritance  rules  exclude  some  links  that  would  be  present  if  the  connection  graph  were  merely  an 
encoding  of  all  permitted  resolution  operations  for  ordinary  resolution.  Exactly  which  links  are  excluded 
depends  on  the  order  in  which  resolution  operations  are  performed.  The  effect  of  connection-graph 
resolution  is  to  impose  the  following  restriction:  if  a  pair  of  atoms  in  a  pair  of  wffs  is  resolved  upon, 
atoms  derived  (in  later  resolution  operations)  from  the  resolved-on  atoms  cannot  be  resolved  against 
each  other.  An  example  of  this  (and  a  reason  for  the  power  of  this  restriction)  is  that,  if  a  set  of  wffs 
includes  P  V  Q  and  -<P  V  ~>Q,  these  two  wffs  can  be  resolved  upon  P  and  Q — resulting  in  tautologies 
that  are  discarded;  after  that,  neither  wff  can  be  resolved  with  an  atom  descended  from  the  other,  even 
though  doing  so  would  not  result  in  a  tautology. 

Connection-graph  resolution  procedures  can  possibly  be  incomplete  by  succeeding  in  finding 
refutations  when  links  are  resolved  upon  in  some  orders,  but  not  others.  For  example,  consider  the 
combination  of  linear  resolution  and  connection-graph  resolution  for  clauses.  Each  is  complete,  but  the 
combination  is  not.  If  linear  connection-graph  resolution  is  applied  to  {  P  V  ~'Q,  ~>P  V  ~>Q,  Q  }  with  Q 
as  top  clause,  depth-first  search  will  find  a  refutation,  but  breadth-first  search  will  not.  This  contrasts 
with  the  usual  situation  in  which  breadth-first  search  is  “safe”,  always  guaranteed  to  find  a  refutation 
if  there  is  one.  To  see  that  it  fails  in  this  case,  observe  that  after  P  and  ->P  are  generated  on  the  first 
level  of  breadth-first  search,  Q  and  -'Q  have  no  links — and  thus  none  of  the  three  input  clauses  can  be 
further  resolved  upon  to  lead  to  a  refutation.  P  and  -»P  are  linked,  but  cannot  be  resolved  without 
violating  the  linear-resolution  restriction. 

A  set  of  assertions  in  a  connection  graph  can  to  some  extent  be  regarded  and  treated  as  a 
semantic  network — more  so  than  the  same  set  of  assertions  without  the  connection  graph. 
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For  example,  the  full  connection  graph  for 


clephant(Clyde) 
elcphant{x )  D  mammal(x) 
elephant(y)  D  color(y,  gray) 
mammal(z)  D  animal(z) 

would  contain  links  between  the  following  pairs  of  atoms 

(x.  (elephant(Clyde),  elephant(x)) 
i3.  {elephont(Cldye),  elephant(y )) 

/s.  {mammal(x),mammal(z)). 

Answers  to  such  queries  as  “What  color  is  Clyde?”  and  “Is  Clyde  an  animal?”  can  be  found  by  graph 
searching  with  minimal  analysis  of  the  assertions,  by  traversing  the  links  in  the  connection  graph.  Such 
searching  can  be  made  more  efficient  by  labeling  the  links  (e.g.,  it  a  for  ti  and  ts,  hat  color  tor  t3). 
The  semantic  content  of  the  set  of  assertions  is  still  conveyed  by  the  assertions  themselves,  but  control 
information  is  provided  to  a  graph-searching  procedure  by  the  link  labels. 

Similar  comments  could  be  made  regarding  any  logical  representation.  However,  the  use  of  a 
connection  graph  in  which  all  permissible  remaining  resolution  operations  are  encoded  in  explicit  links 
can  yield  greater  efficiency  by  eliminating  traversal  of  multiple  paths  to  the  same  goal.  For  example, 
suppose  l3  is  resolved  upon,  resulting  in  the  added  assertion 

elephant(w)  D  animal{w) 


and  the  added  link 

l4.  ( elephant(Clyde),elephant(w )). 

The  link  l3  is  deleted.  There  is  still  only  one  path  or  proof  that  Clyde  is  an  animal,  since  the  absence 
of  ts  blocks  the  path  or  proof  elephant(Clyde )  -*•  mammal(Clyde)  -*  animal(Clyde). 

Graph  searching  in  the  connection  graph  to  determine  taxonomic  relations  quickly  is  a  simple 
illustration  of  the  more  general  notion,  extensively  explored  in  [An81,Si76|,  of  using  graph  searching 
to  determine  the  existence  of  refutations.  The  ideas  and  techniques  developed  there  are  applicable 
to  nonclausal  connection-graph  resolution.  Connection-graph  resolution  appears  to  offer  the  following 
advantages  over  these  other  schemes: 
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•  Although  graph  searching  can  be  done  in  the  connection-graph  resolution  pro¬ 
cedure,  [An81,Si76|  do  not  allow  for  the  actual  formation  of  resolvents.  If  their 
techniques  for  graph  search  were  adopted  as  a  device  for  planning  or  quick  refuta¬ 
tion,  connection-graph  resolution  could  be  regarded  as  a  superset  of  these  other 
methods. 

•  The  actual  formation  of  resolvents  and  the  resulting  change  in  the  connection 
graph  are  useful  for  retaining  information  during  a  refutation,  as  well  as  for 
conveying  information  (about  usage  of  wife,  etc.)  from  one  refutation  or  assertion 
to  the  next.  (Here  it  is  assumed  that  the  theorem  prover  is  being  used  with  an 
assertional  database  to  which  queries  are  posed  and  assertions  occasionally  added 
and  deleted,  as  opposed  to  the  usual  situation  in  theorem  proving  in  which  there 
is  no  persistent  assertional  database,  ail  axioms  being  presented  anew  for  each 
proof.) 

•  Connection-graph  resolution  provides  a  convenient,  albeit  unsophisticated,  means 
of  interleaving  matching  complementary  literals  and  adding  new  instances  of 
assertions  (if  more  than  one  ground  instance  of  a  wff  is  required),  as  compared 
with  the  separate  processes  of  searching  for  a  mating,  and  quantifier  duplication 
if  the  search  fails  |An8I]. 

Of  course,  the  argument  in  favor  of  performing  only  graph  searching -as  in  [An81,  Si76]  is 
that  forming  resolvents  is  expensive  compared  to  traversing  links,  and  the  cost  of  creating  and  storing 
inherited  links  may  be  high. 

A  good  system  will  probably  have  a  mixture  of  resolution  and  graph  searching,  as  in  [BE8l| 
for  clausal  connection-graph  resolution.  Graph  searching  is  used  in  that  system  for  look-ahead  and  to 
determine  if  a  refutation  exists  within  a  certain  number  of  steps.  Simple  graph  searching  is  used  (e.g.,  not 
looking  for  refutations  in  which  wffs  occur  more  than  once),  with  the  full  complexity  and  completeness 
of  connection-graph  resolution  in  the  background — ready  to  attempt  a  refutation. 

One  problem  with  graph  searching  to  find  refutations  is  in  assessing  the  effectiveness  of  the 
procedure.  In  ordinary  resolution  theorem  proving,  effectiveness  can  be  evaluated  in  part  by  examining 
the  number  of  clauses  generated,  retained,  used  in  the  refutation,  and  so  forth.  (BE81)  states  “Within 


37 


this  frame  of  reference  it  would  be  easy  to  design  the  ‘perfect’  proof  procedure:  the  supervisor  and  the 
look-ahead  heuristics  would  find  the  proof  and  then  guide  the  system  without  any  unnecessary  steps 
through  the  search  space.”  The  amount  of  time  used  is  a  good  measure  for  such  a  program,  but  should 
not  be  used  to  compare  programs  as  there  may  be  differences  in  the  machines  the  programs  run  on  and 
in  the  efficiency  of  the  programs  themselves  (as  opposed  to  the  algorithms).  In  general,  as  (BG81)  states, 
a  measure  incorporating  both  total  time  and  space  will  be  required,  adding  the  further  complication  of 
evaluating  time-space  trade-offs. 


4.4.  Control 

A  link  scheduler  is  used  to  specify  a  refutation  search  strategy.  When  an  assertion  is  added  by 
the  user,  it  is  linked  in  the  connection  graph  to  all  previous  assertions.  When  a  resolvent  is  added,  it  is 
linked  to  the  other  assertions  according  to  the  link  inheritance  rules.  All  such  added  links  are  examined 
by  the  link  scheduler.  Three  outcomes  are  possible: 

•  The  link  is  deleted.  For  example,  analysis  may  show  that  resolving  on  the  link 
would  create  a  tautology  or  pure  wff  that  could  not  be  used  in  a  refutation, 
whereupon  the  link  can  be  deleted. 

•  The  link  is  retained,  but  not  scheduled.  Thus  the  link  can  be  inherited,  but 
cannot  be  resolved  upon  (though  its  descendants  might  be).  This  is  done  when 
combining  connection-graph  resolution  with  other  refinements  of  resolution,  such 
as  ordering  restrictions  and  the  special  logical-connective  restrictions  described 
below. 

•  The  link  is  scheduled.  It  is  given  a  numerical  score  and  placed  in  the  link  schedule. 

The  theorem  prover  operates  by  repeatedly  resolving  on  the  best  scored  link  in 
the  schedule,  creating  the  resolvent,  and  scheduling  the  added  links. 

Scheduling  of  the  links  is  done  after  all  the  new  links  have  been  added,  so  that  the  link  scheduler 
can  act  on  such  important  facts  as  the  number  of  links  attached  to  an  atom. 

Special  logical  connectives  can  be  used  to  impose  restrictions  on  the  use  of  particular  assertions. 
As  in  [M08O],  the  following  connectives  denote  the  following  procedural  interpretations  ot  A  D  B  : 

•  A  -*  B.  If  literal  A  is  ever  asserted,  assert  B  (forward  chaining). 
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•  B  *-  A.  To  prove  literal  3,  try  to  prove  A  (backward  chaining).  Since  a  refutation 
procedure  is  being  used,  this  is  interpreted  as  ‘‘permit  the  resolution,  on  literal 


A,  between  A  D  B  and  any  wff  having  support  and  containing  A”. 

•  A  **  B.  If  literal  A  is  ever  asserted,  also  assert  B  and,  to  prove  ->A,  try  to  prove 

-<B. 

•  B  «=  A.  To  prove  literal  3,  try  to  prove  A  and,  if  ->3  is  ever  asserted,  also  assert 
->A. 

•  A  D  B  and  -'A  V  B.  Unrestricted  and  equivalent. 

The  use  of  both  nonclausal  resolution  and  these  special  logical  connectives  gives  this  program 
some  resemblance  to  natural  deduction  [B177].  It  represents  an  intermediate  point  between  clausal 
resolution  and  natural  deduction,  with  advantages  of  each.  It  differs  from  natural  deduction,  since,  for 
example,  a  backward-chaining  application  of  A  D  3  to  C  would  result  in  -<A  V  C(B  «—  T)  rather  than 
C(B  <-  A)  (with  perhaps  only  a  single  instance  of  3  replaced,  requiring  additional  operations  to  replace 
the  other  occurrences).  The  latter  expression  may  be  more  natural,  but  the  former  is  more  concise 
because  all  occurrences  of  3  are  eliminated  and  only  a  single  instance  of  A  is  added.  Heuristic  search  is 
used  in  a  manner  similar  to  the  way  it  is  employed  in  a  clausal  system  [St74j  and  in  a  natural-deduction 
system  [Ty8l], 


4.5.  Implement ation  Status  and  Future  Plans 

The  theorem-proving  program  is  implemented  as  a  4000+  line  INTERLISP  program  and  is 
presently  being  used  as  the  deduction  component  of  MICROKLAUS.  Natural-language  assertions  and 
queries  are  translated  by  the  DIALOGIC  system  (described  in  Section  2)  into  logical  form.  This  logical 
form  is  further  translated  into  predicate  calculus  for  input  to  the  theorem  prover.  The  allowance  for 
predicate  variables  extends  the  program  slightly  beyond  ordinary  first-order  predicate  calculus.  Future 
work  will  expand  the  range  of  logical  form  handled,  as  not  all  logical  forms  that  can  be  generated  by 
DIALOGIC  are  presently  being  translated;  the  range  of  logical  form  generated  by  DIALOGIC  is  also  being 
expanded).  The  program  was  also  used  as  the  deduction  component  of  the  simple  question-answering 
system  described  in  Section  3.3.3. 

Besides  the  unification  filtering  provided  by  the  connection  graph,  atoms  in  assertions  are 
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indexed  by  predicate  symbol  so  as  to  speed  the  addition  to  the  connection  graph  of  user  input  assertions 
when  there  is  a  large  number  of  them.  Wffs  are  also  indexed  by  their  propositional  structure  and  predicate 
symbols  to  speed  checking  for  alphabetic-commutative  variants  to  be  eliminated.  More  efficient  indexing 
schemes  will  probably  be  tried  and  variant  elimination  replaced  by  subsumption. 

Factorization  has  not  yet  been  implemented  in  the  program.  When  two  wffs  are  resolved  upon 
a  pair  of  atoms,  all  atoms  instantiated  to  be  the  same  as  the  instantiated  resolved-on  atoms  are  replaced 
by  F  or  T,  but  there  is  no  effort  to  force  additional  atoms,  by  further  instantiation,  to  be  the  same  as 
the  resolved-on  atoms.  Thus,  only  “obvious”  factors  are  used.  This  is  incomplete,  but  effective.  Factor 
links  will  be  added  for  completeness. 

So  far,  only  fairly  simple  evaluation  functions  have  been  used  in  the  search  control  process. 
They  are  similar  to  those  used  in  |St74],  being  weighted  sums  of  the  deduction  depth  of  the  wff  (a 
measure  of  the  effort  required  to  derive  the  wff)  and  the  number  of  atoms  in  the  wff  (a  measure  of  the 
additional  effort  that  will  be  needed  to  complete  a  refutation  using  the  wff).  Performance  is  generally 
superior  to  that  in  [St74|.  In  ordering  restrictions,  atoms  are  also  evaluated  according  to  how  many 
links  are  connected  to  them,  so  that  atoms  with  fewer  links  can  be  resolved  upon  preferentially.  Not 
only  is  the  immediate  branching  factor  reduced,  bnt  there  is  also  the  p:  spect  that  the  other  atoms 
with  more  links  will  be  instantiated  and  inherit  fewer  links  when  the  resolution  operation  is  performed. 
Interestingly,  as  was  also  noted  in  [StT4|,  there  can  be  negative  interactions  among  individually  good 
ideas  on  search  control.  For  example,  a  strong  length-preference  strategy  and  the  strategy  of  resolving  on 
an  atom  with  the  fewest  links  are  somewhat  inconsistent.  When  there  are  many  assertions  about  some 
predicate — some  short  and  specific,  others  long  and  general — the  atom  with  the  fewest  links  is  likely  to  be 
linked  only  to  long  and  general  assertions.  Resolving  on  it  thus  may  result  in  long  resolvents  that  would 
be  given  low  preference  by  a  strong-length  preference  strategy.  More  work  will  be  done  on  developing 
good  evaluation  functions.  The  most  important  extension  of  the  program  will  be  further  development 
of  connection-graph  searching — both  to  provide  input  to  evaluation  functions  and  to  perform  entire 
deductions  without  creating  any  resolvents. 
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